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IDENTIFICATION OF BROADLY REACTIVE DR RESTRICTED EPITOPES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application is a continuation in part of USSN 60/036,713, filed 
January 23, 1997 and 60/037,432 filed February 7, 1997, both of which are incorporated 
herein by reference. 

BACKGROUND OF THE INVENTION 

Helper T lymphocytes (HTL) play several important functions in immunity 
to pathogens. Firstly, they provide help for induction of both CTL and antibody 
responses. By both direct contact and by secreting lymphokines such as IL2 and IL4, 
HTL promote and support the expansion and differentiation of T and B cell precursors into 
effector cells. In addition, HTL can also be effectors in their own right, an activity also 
mediated by direct cell contact and secretion of lymphokines, such as IFNy and TNFa. 
HTL have been shown to have direct effector activity in case of tumors, as well as viral, 
bacterial, parasitic, and fungal infections. 

HTL recognize a complex formed between Class II MHC molecules and 
antigenic peptides, usually between 10 and 20 residues long, and with an average size of 
between 13 and 16 amino acids. Peptide-Class II interactions have been analyzed in 
detail, both at the structural and functional level, and peptide motifs specific for various 
human and mouse Class II molecules have been proposed. 

In the last few years, epitope based vaccines have received considerable 
attention as a possible mean to develop novel prophylactic vaccines and immunotherapeutic 
strategies. Selection of appropriate T and B cell epitopes should allow to focus the 
immune system toward conserved epitopes of pathogens which are characterized by high 
sequence variability (such as HIV, HCV and Malaria). 

In addition, focusing the immune response towards selected determinants 
could be of value in the case of various chronic viral diseases and cancer, where T cells 
directed against the immunodominant epitopes might have been inactivated while T cells 
specific for subdominant epitopes might have escaped T cell tolerance. The use of epitope 
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based vaccines also allows to avoid "suppressive" T cell determinants which induce TH 2 
responses, in conditions where a TU X response is desirable, or vice versa. 

Finally, epitope based vaccines also offer the opportunity to include in the 
vaccine construct epitopes that have been engineered to modulate their potency, either by 
increasing MHC binding affinity, or by alteration of its TCR contact residues, or both. 
Inclusion of completely synthetic non-natural or generically unrelated to the pathogen 
epitopes (such as TT derived "universal" epitopes), also represents a possible mean of 
modulating the HTL response toward a TH^ or TH 2 phenotype. 

Once appropriate epitope determinants have been defined, they can be 
assorted and delivered by various means, which include lipopeptides, viral delivery 
vectors, particles of viral or synthetic origin, naked or particle absorbed cDNA. 

However, before appropriate epitopes can be defined, one major obstacle 
has to be overcome, namely the very high degree of polymorphism of the MHC molecules 
expressed in the human population. In fact, more than two hundred different types of 
HLA Class I and Class II molecules have already been identified. It has been 
demonstrated that in the case of HLA Class I molecules, peptides capable of binding 
several different HLA Class I molecules can be identified. Over 60% of the known HLA 
Class I molecules can, in fact, be grouped in four broad HLA supertypes, characterized by 
similar peptide binding specificities (HLA supermotif s) . 

In the case of Class III molecules, it is also known that peptides capable of 
binding multiple HLA types and of being immunogenic in the context of different HLA 
molecules do indeed exist. Until now, however, a general method for their identification 
has not been developed, probably at least in part a reflection of the fact that quantitative 
DR binding assays are labor intensive and that a large number of alleles must to be 
considered. 

The present invention addresses these and other needs. 



WO 98/32456 3 PCT/US98/01373 

SUMMARY OF THE INVENTION 

The present invention is based, at least in part, on the discovery and 
validation of specific motifs and assay systems for various DR molecules, representative of 
the worldwide population. Their application to the identification of broadly degenerate 
HLA Class II binding peptides is also described. 

Definitions 

The term "peptide" is used interchangeably with "oligopeptide" in the 
present specification to designate a series of residues, typically L-amino acids, connected 
one to the other typically by peptide bonds between the alpha-amino and carbonyl groups 
of adjacent amino acids. The oligopeptides of the invention are less than about 50 residues 
in length and usually consist of between about 10 and about 30 residues, more usually 
between about 12 and 25, and often between about 15 and about 20 residues. 

An "immunogenic peptide" is a peptide which comprises an allele-specific 
motif such that the peptide will bind an MHC molecule and induce a HTL response. 
Immunogenic peptides of the invention are capable of binding to an appropriate HLA 
molecule and inducing HTL response against the antigen from which the immunogenic 
peptide is derived. 

A "conserved residue" is a conserved amino acid occupying a particular 
position in a peptide motif typically one where the MHC structure may provide a contact 
point with the immunogenic peptide. One to three, typically two, conserved residues 
within a peptide of defined length defines a motif for an immunogenic peptide. These 
residues are typically in close contact with the peptide binding groove, with their side 
chains buried in specific pockets of the groove itself. 

The term "motif" refers to the pattern of residues of defined length, usually 
between about 8 to about 11 amino acids, which is recognized by a particular MHC allele. 

The term "supermotif" refers to motifs that, when present in an 
immunogenic peptide, allow the peptide to bind more than one HLA antigen. The 
supermotif preferably is recognized by at least one HLA allele having a wide distribution 
in the human population, preferably recognized by at least two alleles, more preferably 
recognized by at least three alleles, and most preferably recognized by more than three 
alleles. 
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The phrases "isolated" or "biologically pure" refer to material which is 
substantially or essentially free from components which normally accompany it as found in 
its native state. Thus, the peptides of this invention do not contain materials normally 
associated with their in situ environment, e.g., MHC I molecules on antigen presenting 
cells. Even where a protein has been isolated to a homogenous or dominant band, there 
are trace contaminants in the range of 5-10% of native protein which co-purify with the 
desired protein. Isolated peptides of this invention do not contain such endogenous co- 
purified protein. 

The term "residue" refers to an amino acid or amino acid mimetic 
incorporated in an oligopeptide by an amide bond or amide bond mimetic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR4w4 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

Figure 2A presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR1 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

Figure 2B presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR7 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention relates to compositions and methods for preventing, 
treating or diagnosing a number of pathological states such as viral, fungal, bacterial and 
parasitic diseases and cancers. In particular, it provides novel peptides capable of binding 
selected major histocompatibility complex (MHC) class II molecules and inducing an 
immune response. 

Peptide binding to MHC molecules is determined by the allelic type of the 
MHC molecule and the amino acid sequence of the peptide. MHC class I-binding peptides 
usually contain within their sequence two conserved ("anchor") residues that interact with 
corresponding binding pockets in the MHC molecule. Specific combination of anchor 
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residues (usually referred to as "MHC motifs") required for binding by several allelic 
forms of human MHC (HLA, histocompatibility leukocyte antigens) are described in 
International Applications WO 94/03205 and WO 94/20127. Definition of specific MHC 
motifs allows one to predict from the amino acid sequence of an individual protein, which 
5 peptides have the potential of being immunogenic for CTL. These applications describe 

methods for preparation and use of immunogenic peptides in the treatment of disease. The 
peptides described here can also be used as helper T peptides in combination with peptides 
which induce a CTL response. This is described in WO 95/07077. 

The DR-binding peptides of the present invention or nucleic acids encoding 

10 them can be administered to mammals, particularly humans, for prophylactic and/or 

therapeutic purposes. The DR peptides can be used to enhance immune responses against 
other immunogens administered with the peptides, when the peptides of the invention are 
used as helper peptides. For instance, mixtures of peptides of the invention in 
combination with peptides that induce CTL responses may be used to treat and/or prevent 

15 viral infection and cancer. Alternatively, immunogens which induce antibody responses 
can be used. Examples of diseases which can be treated using the immunogenic mixtures 
of DR peptides and other immunogens include prostate cancer, hepatitis B, hepatitis C, 
AIDS, renal carcinoma, cervical carcinoma, lymphoma, CMV and condyloma 
acuminatum. 

20 The DR-binding peptides or nucleic acids encoding them may also be used 

to treat a variety of conditions involving unwanted T cell reactivity. Examples of diseases 
which can be treated using DR-binding peptides include autoimmune diseases (e.g., 
rheumatoid arthritis, multiple sclerosis, and myasthenia gravis), allograft rejection, 
allergies (e.g., pollen allergies), lyme disease, hepatitis, LCMV, poststreptococcal 

25 endocarditis, or glomerulonephritis, and food hypersensitivities. 

In therapeutic applications, the immunogenic compositions or the DR- 
binding peptides or nucleic acids of the invention are administered to an individual already 
suffering from cancer, autoimmune disease, or infected with the virus of interest. Those 
in the incubation phase or the acute phase of the disease may be treated with the DR- 

30 binding peptides or immunogenic conjugates separately or in conjunction with other 
treatments, as appropriate. 
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In therapeutic applications, compositions comprising immunogenic 
compositions are administered to a patient in an amount sufficient to elicit an effective 
immune response to the virus or tumor antigen and to cure or at least partially arrest 
symptoms and/or complications. Similarly, compositions comprising DR-binding peptides 
5 are administered in an amount sufficient to cure or at least partially arrest the symptoms of 
the disease and its complications. An amount adequate to accomplish this is defined as 
"therapeutically effective dose." Amounts effective for this use will depend on, e.g., the 
peptide composition, the manner of administration, the stage and severity of the disease 
being treated, the weight and general state of health of the patient, and the judgment of the 

10 prescribing physician. 

Therapeutically effective amounts of the immunogenic compositions of the 
present invention generally range for the initial immunization (that is for therapeutic or 
prophylactic administration) from about 1.0 fig to about 10,000 fig of peptide for a 70 kg 
patient, usually from about 100 to about 8000 fig, and preferably between about 200 and 

15 about 6000 fig. These doses are followed by boosting dosages of from about 1.0 fig to 

about 1000 fig of peptide pursuant to a boosting regimen over weeks to months depending 
upon the patient's response and condition by measuring specific immunogenic activity in 
the patient's blood. 

It must be kept in mind that the compositions of the present invention may 

20 generally be employed in serious disease states, that is, life-threatening or potentially life- 
threatening situations. In such cases, in view of the minimization of extraneous substances 
and the relative nontoxic nature of the conjugates, it is possible and may be felt desirable 
by the treating physician to administer substantial excesses of these compositions. 

For prophylactic use, administration should be given to risk groups. For 

25 example, protection against malaria, hepatitis, or AIDS may be accomplished by 

prophylactically administering compositions of the invention, thereby increasing immune 
capacity. Therapeutic administration may begin at the first sign of disease or the detection 
or surgical removal of tumors or shortly after diagnosis in the case of acute infection. 
This is followed by boosting doses until at least symptoms are substantially abated and for 

30 a period thereafter. In chronic infection, loading doses followed by boosting doses may be 
required. 
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Treatment of an infected individual with the compositions of the invention 
may hasten resolution of the infection in acutely infected individuals. For those 
individuals susceptible (or predisposed) to developing chronic infection the compositions 
are particularly useful in methods for preventing the evolution from acute to chronic 

5 infection. Where the susceptible individuals are identified prior to or during infection, for 
instance, as described herein, the composition can be targeted to them, minimizing need 
for administration to a larger population. 

The peptide mixtures or conjugates can also be used for the treatment of 
chronic infection and to stimulate the immune system to eliminate virus-infected cells in 

10 carriers. It is important to provide an amount of immuno-potentiating peptide in a 

formulation and mode of administration sufficient to effectively stimulate a cytotoxic T 
cell response. Thus, for treatment of chronic infection, a representative dose is in the 
range of about 1.0 /xg to about 5000 /*g, preferably about 5 \i% to 1000 /*g for a 70 kg 
patient per dose. Immunizing doses followed by boosting doses at established intervals, 

15 e.g., from one to four weeks, may be required, possibly for a prolonged period of time to 
effectively immunize an individual. In the case of chronic infection, administration should 
continue until at least clinical symptoms or laboratory tests indicate that the viral infection 
has been eliminated or substantially abated and for a period thereafter. 

The pharmaceutical compositions for therapeutic or prophylactic treatment 

20 are intended for parenteral, topical, oral or local administration. Typically, the 
pharmaceutical compositions are administered parenterally, e.g., intravenously, 
subcutaneously, intradermally, or intramuscularly. Because of the ease of administration, 
the vaccine compositions of the invention are particularly suitable for oral administration. 
Thus, the invention provides compositions for parenteral administration which comprise a 

25 solution of the peptides or conjugates dissolved or suspended in an acceptable carrier, 
preferably an aqueous carrier. A variety of aqueous carriers may be used, e.g., water, 
buffered water, 0.9% saline, 0.3% glycine, hyaluronic acid and the like. These 
compositions may be sterilized by conventional, well known sterilization techniques, or 
may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, or 

30 lyophilized, the lyophilized preparation being combined with a sterile solution prior to 
administration. The compositions may contain pharmaceutically acceptable auxiliary 
substances as required to approximate physiological conditions, such as pH adjusting and 
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buffering agents, tonicity adjusting agents, wetting agents and the like, for example, 
sodium acetate, sodium lactate, sodium chloride, potassium chloride, calcium chloride, 
sorbitan monolaurate, triethanolamine oleate, etc. 

The concentration of DR and/or CTL stimulatory peptides of the invention 
5 in the pharmaceutical formulations can vary widely, i.e. , from less than about 0.1%, 

usually at or at least about 2% to as much as 20% to 50% or more by weight, and will be 
selected primarily by fluid volumes, viscosities, etc., in accordance with the particular 
mode of administration selected. 

The peptides and conjugates of the invention may also be administered via 

10 liposomes, which serve to target the conjugates to a particular tissue, such as lymphoid 
tissue, or targeted selectively to infected cells, as well as increase the half-life of the 
peptide composition. Liposomes include emulsions, foams, micelles, insoluble 
monolayers, liquid crystals, phospholipid dispersions, lamellar layers and the like. In 
these preparations the peptide to be delivered is incorporated as part of a liposome, alone 

15 or in conjunction with a molecule which binds to, e.g. , a receptor prevalent among 

lymphoid cells, such as monoclonal antibodies which bind to the CD45 antigen, or with 
other therapeutic or immunogenic compositions. Thus, liposomes filled with a desired 
peptide or conjugate of the invention can be directed to the site of lymphoid cells, where 
the liposomes then deliver the selected therapeutic/ immunogenic peptide compositions. 

20 Liposomes for use in the invention are formed from standard vesicle-forming lipids, which 
generally include neutral and negatively charged phospholipids and a sterol, such as 
cholesterol. The selection of lipids is generally guided by consideration of, e.g., liposome 
size, acid lability and stability of the liposomes in the blood stream. A variety of methods 
are available for preparing liposomes, as described in, e.g., Szoka, et ah, Ann. Rev. 

25 Biophys. Bioeng. 9, 467 (1980), U.S. Patent Nos. 4,235,871, 4,501,728, 4,837,028, and 
5,019,369, incorporated herein by reference. 

For targeting to the immune cells, a ligand to be incorporated into the 
liposome can include, e.g., antibodies or fragments thereof specific for cell surface 
determinants of the desired immune system cells. A liposome suspension containing a 

30 peptide or conjugate may be administered intravenously, locally, topically, etc. in a dose 
which varies according to, inter alia, the manner of administration, the conjugate being 
delivered, and the stage of the disease being treated. 
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Alternatively, DNA or RNA encoding one or more DR peptides and a 
polypeptide containing one or more CTL epitopes or antibody inducing epitopes may be 
introduced into patients to obtain an immune response to the polypeptides which the 
nucleic acid encodes. Wolff, et. ah, Science 247: 1465-1468 (1990) describes the use of 
5 nucleic acids to produce expression of the genes which the nucleic acids encode. Such use 
is also disclosed in U.S. Patent Nos. 5,580,859 and 5,589,466. The nucleic acids can also 
be administered using ballistic delivery as described, for instance, in U.S. Patent No. 
5,204,253. Particles comprised solely of DNA can be administered. Alternatively, DNA 
can be adhered to particles, such as gold particles. The nucleci acids can also be delivered 

10 complexed to cationic compounds, such as cationic lipids. Lipid-mediated gene delivery 
methods are described, for instance, in WO 96/18372; WO 93/24640; Mannino and 
Gould-Fogerite (1988) BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; WO 
91/06309; and Feigner et al (1987) Proc. Natl. Acad. Sci. USA 84: 7413-7414. The 
peptides of the invention can also be expressed by attenuated viral hosts, such as vaccinia 

15 or fowlpox. This approach involves the use of vaccinia virus as a vector to express 

nucleotide sequences that encode the peptides of the invention. Upon introduction into an 
acutely or chronically infected host or into a noninfected host, the recombinant vaccinia 
virus expresses the immunogenic peptide, and thereby elicits a host CTL response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 

20 Patent No. 4,722,848, incorporated herein by reference. Another vector is BCG (Bacille 
Calmette Guerin). BCG vectors are described in Stover et al. ( Nature 351:456-460 
(1991)) which is incorporated herein by reference. A wide variety of other vectors useful 
for therapeutic administration or immunization of the peptides of the invention, e.g., 
Salmonella typhi vectors and the like, will be apparent to those skilled in the art from the 

25 description herein. 

A preferred means of administering nucleic acids encoding the peptides of 
the invention uses minigene constructs encoding multiple peptides of the invention along 
with CTL inducing peptides. To create a DNA sequence encoding the selected DR 
peptides and CTL epitopes for expression in human cells, the amino acid sequences of the 

30 epitopes are reverse translated. A human codon usage table is used to guide the codon 
choice for each amino acid. These epitope-encoding DNA sequences are directly 
adjoined, creating a continuous polypeptide sequence. To optimize expression and/or 
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immunogenicity, additional elements can be incorporated into the minigene design. 
Examples of amino acid sequence that could be reverse translated and included in the 
minigene sequence include: DR peptides of the invention, a leader (signal) sequence, one 
or more CTL epitope, and an endoplasmic reticulum retention signal. In addition, MHC 
5 presentation of CTL epitopes may be improved by including synthetic (e.g. poly-alanine) 
or naturally-occurring flanking sequences adjacent to the CTL epitopes. 

The minigene sequence is converted to DNA by assembling oligonucleotides 
that encode the plus and minus strands of the minigene. Overlapping oligonucleotides (30- 
100 bases long) are synthesized, phosphorylated, purified and annealed under appropriate 

10 conditions using well known techniques, he ends of the oligonucleotides are joined using 

T4 DNA ligase. This synthetic minigene, encoding the CTL epitope polypeptide, can then 
cloned into a desired expression vector. 

Standard regulatory sequences well known to those of skill in the art are 
included in the vector to ensure expression in the target cells. Several vector elements are 

15 required: a promoter with a down-stream cloning site for minigene insertion; a 
polyadenylation signal for efficient transcription termination; an E. coli origin of 
replication; and an E. coli selectable marker (e.g. ampicillin or kanamycin resistance). 
Numerous promoters can be used for this purpose, e.g., the human cytomegalovirus 
(hCMV) promoter. See, U.S. Patent Nos. 5,580,859 and 5,589,466 for other suitable 

20 promoter sequences. 

Additional vector modifications may be desired to optimize minigene 
expression and immunogenicity. In some cases, introns are required for efficient gene 
expression, and one or more synthetic or naturally-occurring introns could be incorporated 
into the transcribed region of the minigene. The inclusion of mRNA stabilization 

25 sequences can also be considered for increasing minigene expression. It has recently been 
proposed that immunostimulatory sequences (ISSs or CpGs) play a role in the 
immunogenicity of DNA vaccines. These sequences could be included in the vector, 
outside the minigene coding sequence, if found to enhance immunogenicity. 

In some embodiments, a bicistronic expression vector, to allow production 

30 of the minigene-encoded epitopes and a second protein included to enhance or decrease 

immunogenicity can be used. Examples of proteins or polypeptides that could beneficially 
enhance the immune response if co-expressed include cytokines (e.g., IL2, IL12, GM- 
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CSF), cytokine-inducing molecules (e.g. LeIF) or costimulatory molecules. The HTL 
epitopes of the invention could be joined to intracellular targeting signals and expressed 
separately from the CTL epitopes. This would allow direction of the HTL epitopes to a 
cell compartment different than the CTL epitopes. If required, this could facilitate more 
5 efficient entry of HTL epitopes into the MHC class II pathway, thereby improving CTL 
induction. In contrast to CTL induction, specifically decreasing the immune response by 
co-expression of immunosuppressive molecules (e.g. TGF-p) may be beneficial in certain 
diseases. 

Once an expression vector is selected, the minigene is cloned into the 
10 poly linker region downstream of the promoter. This plasmid is transformed into an 
appropriate E. coli strain, and DNA is prepared using standard techniques. The 
orientation and DNA sequence of the minigene, as well as all other elements included in 
the vector, are confirmed using restriction mapping and DNA sequence analysis. Bacterial 
cells harboring the correct plasmid can be stored as a master cell bank and a working cell 
15 bank. 

Therapeutic quantities of plasmid DNA are produced by fermentation in E. 
coli, followed by purification. Aliquots from the working cell bank are used to inoculate 
fermentation medium (such as Terrific Broth), and grown to saturation in shaker flasks or 
a bioreactor according to well known techniques. Plasmid DNA can be purified using 

20 standard bioseparation technologies such as solid phase anion-exchange resins supplied by 
Quiagen. If required, supercoiled DNA can be isolated from the open circular and linear 
forms using gel electrophoresis or other methods. 

Purified plasmid DNA can be prepared for injection using a variety of 
formulations. The simplest of these is reconstitution of lyophilized DNA in sterile 

25 phosphate-buffer saline (PBS). A variety of methods have been described, and new 
techniques may become available. As noted above, nucleic acids are conveniently 
formulated with cationic lipids. In addition, glycolipids, fusogenic liposomes, peptides 
and compounds referred to collectively as protective, interactive, non-condensing (PINC) 
could also be complexed to purified plasmid DNA to influence variables such as stability, 

30 intramuscular dispersion, or trafficking to specific organs or cell types. 

Target cell sensitization can be used as a functional assay for expression and 
MHC class I presentation of minigene-encoded CTL epitopes. The plasmid DNA is 
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introduced into a mammalian cell line that is suitable as a target for standard CTL 
chromium release assays. The transfection method used will be dependent on the final 
formulation. Electroporation can be used for "naked" DNA, whereas cationic lipids allow 
direct in vitro transfection. A plasmid expressing green fluorescent protein (GFP) can be 
5 co-transfected to allow enrichment of transfected cells using fluorescence activated cell 
sorting (FACS). These cells are then chromium-51 labeled and used as target cells for 
epitope-specific CTL lines. Cytolysis, detected by 51Cr release, indicates production of 
MHC presentation of minigene-encoded CTL epitopes. 

In vivo immunogenicity is a second approach for functional testing of 

10 minigene DNA formulations. Transgenic mice expressing appropriate human MHC 

molecules are immunized with the DNA product. The dose and route of administration 
are formulation dependent (e.g. IM for DNA in PBS, IP for lipid-complexed DNA). 
Twenty-one days after immunization, splenocytes are harvested and restimulated for 1 
week in the presence of peptides encoding each epitope being tested. These effector cells 

15 (CTLs) are assayed for cytolysis of peptide-loaded, chromium-51 labeled target cells using 
standard techniques. Lysis of target cells sensitized by MHC loading of peptides 
corresponding to minigene-encoded epitopes demonstrates DNA vaccine function for in 
vivo induction of CTLs. 

For solid compositions, conventional nontoxic solid carriers may be used 

20 which include, for example, pharmaceutical grades of mannitol, lactose, starch, 

magnesium stearate, sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium 
carbonate, and the like. For oral administration, a pharmaceutically acceptable nontoxic 
composition is formed by incorporating any of the normally employed excipients, such as 
those carriers previously listed, and generally 10-95% of active ingredient, that is, one or 

25 more conjugates of the invention, and more preferably at a concentration of 25% -75%. 

For aerosol administration, the peptides are preferably supplied in finely 
divided form along with a surfactant and propellant. Typical percentages of conjugates are 
0.01%-20% by weight, preferably 1%-10%. The surfactant must, of course, be nontoxic, 
and preferably soluble in the propellant. Representative of such agents are the esters or 

30 partial esters of fatty acids containing from 6 to 22 carbon atoms, such as caproic, 

octanoic, lauric, palmitic, stearic, linoleic, linolenic, olesteric and oleic acids with an 
aliphatic polyhydric alcohol or its cyclic anhydride. Mixed esters, such as mixed or 
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natural glycerides may be employed. The surfactant may constitute 0. 1 %-20% by weight 
of the composition, preferably 0.25-5%. The balance of the composition is ordinarily 
propellant. A carrier can also be included, as desired, as with, e.g., lecithin for intranasal 
delivery. 

5 In another aspect the present invention is directed to vaccines which contain 

as an active ingredient an immunogenically effective amount of an immunogenic DR 
peptide or a CTL\DR peptide conjugate or nucleic acid encoding them as described herein. 
The conjugate(s) may be introduced into a host, including humans, linked to its own 
carrier or as a homopolymer or heteropolymer of active peptide units. Such a polymer has 

10 the advantage of increased immunological reaction and, where different peptides are used 
to make up the polymer, the additional ability to induce antibodies and/or CTLs that react 
with different antigenic determinants of the virus or tumor cells. Useful carriers are well 
known in the art, and include, e.g., thyroglobulin, albumins such as bovine serum 
albumin, tetanus toxoid, polyamino acids such as poly (lysine: glutamic acid), hepatitis B 

15 virus core protein, hepatitis B virus recombinant vaccine and the like. The vaccines can 
also contain a physiologically tolerable (acceptable) diluent such as water, phosphate 
buffered saline, or saline, and further typically include an adjuvant. Adjuvants such as 
incomplete Freund's adjuvant, aluminum phosphate, aluminum hydroxide, or alum are 
materials well known in the art. And, as mentioned above, CTL responses can be primed 

20 by conjugating peptides of the invention to lipids, such as P 3 CSS. Upon immunization 

with a peptide composition as described herein, via injection, aerosol, oral, transdermal or 
other route, the immune system of the host responds to the vaccine by producing large 
amounts of CTLs specific for the desired antigen, and the host becomes at least partially 
immune to later infection, or resistant to developing chronic infection. 

25 Vaccine compositions containing the DR peptides of the invention are 

administered to a patient susceptible to or otherwise at risk of disease, such as viral 
infection or cancer to elicit an immune response against the antigen and thus enhance the 
patient's own immune response capabilities, for instance with CTL epitopes described in 
**. Such an amount is defined to be an "immunogenically effective dose. " In this use, the 

30 precise amounts again depend on the patient's state of health and weight, the mode of 

administration, the nature of the formulation, etc., but generally range from about 1.0 fig 
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to about 5000 /xg per 70 kilogram patient, more commonly from about 10 /xg to about 500 
jig per 70 kg of body weight. 

In some instances it may be desirable to combine the peptide vaccines of the 
invention with vaccines which induce neutralizing antibody responses to the virus of 
5 interest, particularly to viral envelope antigens. For instance, PADRE peptides can be 
combined with hepatitis vaccines to increase potency or broaden population coverage. 
Suitable hepatitis vaccines that can be used in this manner include, Recombivax HB® 
(Merck) and Engerix-B (Smith-Kline). 

For therapeutic or immunization purposes, the peptides of the invention can 

10 also be expressed by attenuated viral hosts, such as vaccinia or fowlpox. This approach 
involves the use of vaccinia virus as a vector to express nucleotide sequences that encode 
the peptides of the invention. Upon introduction into an acutely or chronically infected 
host or into a non-infected host, the recombinant vaccinia virus expresses the immunogenic 
peptide, and thereby elicits a host CTL response. Vaccinia vectors and methods useful in 

15 immunization protocols are described in, e.g., U.S. Patent No. 4,722,848, incorporated 
herein by reference. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et aL, Nature 351, 456-460 (1991)) which is incorporated herein by 
reference. A wide variety of other vectors useful for therapeutic administration or 
immunization of the peptides of the invention, e.g., Salmonella typhi vectors and the like, 

20 will be apparent to those skilled in the art from the description herein. 

Antigenic conjugates may be used to elicit CTL ex vivo, as well. The 
resulting CTL can be used to treat chronic infections (viral or bacterial) or tumors in 
patients that do not respond to other conventional forms of therapy, or will not respond to 
a peptide vaccine approach of therapy. Ex vivo CTL responses to a particular pathogen 

25 (infectious agent or tumor antigen) are induced by incubating in tissue culture the patient's 
CTL precursor cells (CTLp) together with a source of antigen-presenting cells (APC) and 
the appropriate immunogenic peptide. After an appropriate incubation time (typically 1-4 
weeks), in which the CTLp are activated and mature and expand into effector CTL, the 
cells are infused back into the patient, where they will destroy their specific target cell (an 

30 infected cell or a tumor cell). 

The peptides of this invention may also be used to make monoclonal 
antibodies. Such antibodies may be useful as potential diagnostic or therapeutic agents. 
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The peptides may also find use as diagnostic reagents. For example, a 
peptide of the invention may be used to determine the susceptibility of a particular 
individual to a treatment regimen which employs the peptide or related peptides, and thus 
may be helpful in modifying an existing treatment protocol or in determining a prognosis 
5 for an affected individual. In addition, the peptides may also be used to predict which 
individuals will be at substantial risk for developing chronic infection. 

Examples 

Materials and Methods 

Cells. The following Epstein-Barr virus (EBV) transformed homozygous cell lines 

10 were used as sources of human HLA Class II molecules: LG2 [DRBlcOlOl (DR1)1; 
GM3107 [DRB50101 (DR2w2a)]; MAT (DRB10301 (DR3)1; PREISS [DRB10401 
(DR4w4)l; BIN40 [DRB10404 (DR4wl4)l; SWEIG [DRB11101 (DRSwll)]; PITOUT 
[DRB10701 (DR7)] (a); KT3 [DRB10405 (DR4wl5)]; Herluf [DRB11201 (DR5wl2)]; 
HO301 [DRB11302 (DR6wl9)]; OLL [DRB10802 (DR8w2)]; and HTC9074 [DRB10901 

15 (DR9), supplied as a kind gift by Dr. Paul Harris, Columbia University]. In some 

instances, transfected fibroblasts were used: L466.1 [DRB11501 (DR2w2b)]; TR81.19 
[DRB30101 (DR52a)]; and L257.6 [DRB40101 (DRw53)]. (Valli, et al 7. Clin. Invest. 
91:616 (1993). Cells were maintained in vitro by culture in RPMI 1640 medium 
supplemented with 2mM L-glutamine [GIBCO, Grand Island, NY], 50pM 2-ME, and 

20 10% heat-inactivated FCS [Irvine Scientific, Santa Ana, CA]. Cells were also 
supplemented with 100 ftg/ml of streptomycin and lOOU/ml of penicillin [Irvine 
Scientific]. Large quantities of cells were grown in spinner cultures. 

Cells were lysed at a concentration of 10 8 cells/ml in PBS containing 1 % 
NP-40 [Fluka Biochemika, Buchs, Switzerland], ImM PMSF [CalBioChem, La Jolla, 

25 CA], 5mM Na-orthovanadate, and 25mM iodoacetamide [Sigma Chemical, St. Louis, 

Mo]. The lysates were cleared of debris and nuclei by centrifugation at 10,000 x g for 20 
min. 

Affinity purification of HLA-DR molecules. Class II molecules were purified by 
30 affinity chromatography as previously described (Sette, et al. J. Immunol. 142:35 (1989) 
and Gorga, et al J. Biol. Chem. 262:16087 (1987)) using the mAb LB3.1 coupled to 
Sepharose 4B beads. Lysates were filtered through 0.8 and 0.4 juM filters and then passed 
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over the anti-DR column, which were then washed with 15 -column volumes of lOmM 
TRIS in 1 % NP-40, PBS and 2-column volumes of PBS containing 0.4% 
n-octylglucoside. Finally, the DR was eluted with 50mM diethylamine in 0.15M NaCl 
containing 0.4% n-octylglucoside, pH 11.5. A 1/25 volume of 2.0M Tris, pH 6.8, was 
5 added to the eluate to reduce the pH to -8.0, and then concentrated by centrifugation in 
Centriprep 30 concentrators at 2000 rpm (Amicon, Beverly, MA). 

Class II peptide-binding assays. A panel of 13 different specific DR-peptide assays were 
utilized in the present study. These assays were chosen as to be representative of the most 

10 common DR alleles. Table I lists for each DR antigen, the representative allelic product 
utilized, the cell line utilized as a source of DR, and the radiolabled probe utilized in the 
assay. Purified human Class II molecules [5 to 500 nM] were incubated with various 
unlabeled peptide inhibitors and 1-10 nM I25 I-radiolabeled probe peptides for 48h in PBS 
containing 5% DMSO in the presence of a protease inhibitor cocktail. The radiolabeled 

15 probes used were HA Y307-319 (DR1), Tetanus Toxoid [TT] 830-843 (DR2w2a, 

DRSwlll, DR7, DR8w2, DR8w3, DR9), MBP Y85-100 (DR2w2b), TT1272-1284 
(DR52a), MT 65 kD Y3-13 with Y7 substituted with F for DR3, a non-natural peptide 
with the sequence YARFQSQTTLKQKT (DR4w4, DR4wl5, DRw53) (Valli, et al 
supra), and for DR5wl2, a naturally processed peptide eluted from the cell line C1R, 

20 EALIHQLINPYVLS (DR5wl2) and 650.22 peptide, (TT 830-843 A S836 analog), for 
DR6wl9. 

Radiolabeled peptides were iodinated using the chloramine-T method. 
Peptide inhibitors were typically tested at concentrations ranging from 1201 /ig/ml to 1.2 
ng/ml. The data were then plotted and the dose yielding 50% inhibition (IC50) was 

25 measured. In appropriate stoichiometric conditions, the IC50 of an unlabeled test peptide 
to the purified DR is a reasonable approximation of the affinity of interaction (Kd). 
Peptides were tested in two to four completely independent experiments. The final 
concentrations of protease inhibitors were: ImM PMSF, 1.3nM 1.10 phenanthroline, 73 
jjlM pepstatin A, 8mM EDTA, and 200 fxM N alpha-p-tosyl-L-lysine chloromethyl ketone 

30 (TLCK) [All protease inhibitors from CalBioChem, La Jolla, CA]. Final detergent 

concentration in the incubation mixture was 0.05% Nonidet P-40. Assays were performed 
at pH 7.0 with the exception of DR3, which was performed at pH 4.5, and DRw53, which 
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was performed at pH 5.0. The pH was adjusted as previously described (Sette, et al J. 
Immunol 148:844 (1992)). 

Class II peptide complexes were separated from free peptide by gel 
filtration on TSK2000 columns (TosoHaas 16215, Montgomery ville, PA), and the fraction 
5 of bound peptide calculated as previously described (Sette, et aL , (1989) supra). In 
preliminary experiments, the DR prep was titered in the presence of fixed amounts of 
radiolabeled peptides to determine the concentration of Class II molecules necessary to 
bind 10-20% of the total radioactivity. All subsequent inhibition and direct binding assays 
were the performed using these Class II concentrations. 

10 

DRB1 specificity of DR4wl5, DR6wl9, DR8w2, DR8w3, and DR9 assays. 

Because the antibody used for purification is a-chain specific, pi molecules 
are not separated from 03 (and/or 04 and (35) molecules. Development and validation of 
assays in regard with DRp chain specificity has been described in detail elsewhere for 

15 many of the DR alleles listed above (108). Herein we describe for the first time DR4wl5, 
DR6wl9, DR8w2, DR8w3, and DR9 assays. Experiments addressing the p chain 
specificity of these new assays are described in the present section. 

DR4wl5. The 04 product DRw53 is co-expressed with DR4wl5 and the 
determination of the specificity of the DR4wl5 binding assay is complicated in that the 

20 same radiolabeled ligand is used for both the DR4wl5 and DRw53 binding assays. Since 
typically pi chains are expressed at 5-10 fold higher levels than other p chains, and all 
binding assays are performed utilizing limiting DR amounts, it would be predicted that the 
dominant specificity detected in the assay would be DR4wl5. To verify that this was 
indeed the case, the binding pattern of a panel of 58 different synthetic peptides in the 

25 putative DR4wl5 specific assay with that obtained in a DRw53 specific assay (which uses 
a DRw53 fibroblast as the source of Class II molecules). Two very distinct binding 
patterns were noted, and in several instances, a peptide bound to one DR molecule with 
high affinity, and did not bind to the other (data not shown). 

DR6wl9. The DR6wl9 assay utilizes as the source of Class II molecules 

30 the EBV transformed homozygous cell line H0301, which co-expresses DRB30301 

(DR52a). While the radiolabeled ligand used in the DR6wl9 assay is different than that 
used for the DR52a assay, the ligand is related (i.e., is a single substitution analog) to a 
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high affinity DR52a binder. As was done in the case of DR4wl5, the specificity of the 
assay was investigated by analyzing the binding capacity of a panel of naturally occurring 
peptides for DR6wl9 and DR52a. The two assays demonstrated completely different 
binding specificities. For example, in terms of relative binding, TT 1272-1284 binds 
5 63-fold better in the DR52a assay than in the DR6wl9 assay. Conversely, the Invariant 
chain peptide binds 189-fold better in the DR6wl9 assay. In conclusion, these data 
demonstrated that the binding of the radiolabeled peptide 650.22 to purified Class II MHC 
from the H0301 cell line is specific for DR6wl9. 

DR8w2 and DR8w3. The |3l specificity of the DR8w2 and DR8w3 assays 
10 is obvious in that no p3 (and/or B4 and p5) molecule is expressed. 

DR9. The specificity of DR9 assay is inferred from previous studies which 
have shown that the TT 830-843 radiolabeled probe peptide does not bind to DRw53 
molecules (Alexander, et aL, Immunity 1:751 (1994)). 

15 Results 

DR binding affinity of antigenic peptides recognized by DR restricted T cells 

To define a threshold DR binding affinity, to be considered as biologically 
significant, we compiled the affinities of a panel of 32 reported instances of DR restriction 
of a given T cell epitope. In approximately half of the cases, DR restriction was 

20 associated with affinities of less than 100 nM, and in the other half of the instances, with 
IC50% in the 100-1000 nM range. Only in 1 out of 32 cases (3.1%) DR restriction was 
associated with IC50% of 1000 nM or greater. It was noted that this distribution of 
affinities differs from what was previously reported for HLA class I epitopes, where a vast 
majority of epitopes bound with IC50% of 50 nM or less (Sette, et al , JI, 1994). This 

25 relatively lower affinity of class II restricted epitope interactions might explain why 

activation of class II restricted T cells in general requires more antigen relative to class I 
restricted T cells. 

In conclusion, this analysis suggested that 1000 nM may be defined as an 
affinity threshold associated with immunogenicity in the context of DR molecules, and for 
30 this reason a suitable target for our studies. 



PI and P6 anchors are necessary but not sufficient for DRB10401 binding 
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Several independent studies have pointed to a crucial role in DRB 10401 
binding of a large aromatic or hydrophobic residue in position 1 , near the N-terminus of 
the peptide and of a 9-residue core region (residues 1 through 9). In addition, an 
important role has been demonstrated for the residue in position six (P6) of this 9-residues 
5 core region. Short and/or hydrophobic residues were in general preferred in this position 
(O'Sullivan, etal, JI 147:2663, 1991; Sette, etal, JI 151:3163, 1993; Hammer, etaL, 
Cell 74:197, 1993 and Marshall, et al, JI 154:5927, 1995). 

In the present set of experiments, a library of 384 peptides was analyzed for 
DRB 10401 binding capacity and screened for the presence of the P1-P6 motif (that is, F, 

10 W, Y, L, I, V or M in PI and S, T, C, A, P, V, I, L or M in P6, at least 9 residues apart 
from the peptide C-terminus. This set of 384 peptides contained a total of 80 DR4w4 
binders (specifically 27 good binders [IC50 of 100 nM or less], and 53 intermediate 
binders [IC50 of the 100-1000 range]. Seventy-seven out of the 80 DR4w4 binders (96%) 
carried the P1-P6 motif. However, it should be noted that most non-DR4w4 binding 

15 peptides also contained the P1-P6 motif. Of 384 peptides included in our database, only 
125 were "P1-P6 negative." Only three of them (6%) bound appreciably to purified 
DR4w4 as opposed to 77/259 (30%) of the "P1-P6 positive' 1 peptides. Therefore, these 
results demonstrate that presence of suitable PI and P6 anchors are necessary but not 
sufficient for DRB10401 binding. 

20 

A detailed map of DRB10401 peptide interactions 

Next, for each P1-P6 aligned core region, in analogy with what the strategy 
previously utilized to detail peptide class I interactions the average binding affinity of 
peptides carrying a particular residue, relative to the remainder of the group, were 

25 calculated for each position. Following this method a table of average relative binding 
(ARB) values was compiled. This table also represents a map of the positive or negative 
effect of each of the 20 naturally occurring amino acids on DRB 10401 binding capacity 
when occupying a particular position, relative to the main P1-P6 anchors (Figure 1). 

Variations in ARB values greater than four fold (ARB > 4 or < 0.25) were 

30 arbitrarily considered significant and indicative of secondary effects of a given residue on 
DR-peptide interactions. Most secondary effects were associated with positions 4, 7, and 
9. These positions correspond to secondary anchors engaging shallow pockets on the DR 
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molecule. In addition, significant secondary effects were detected for M in position 3 
(ARB = 12.8) T in position 3 (ARB = 4.34) and I in position 5 (ARB = 4.4). 

Development of a DRB10401 specific algorithm 

5 Next, the ARB table was utilized to develop a DRB 10401 specific 

algorithm. In order to predict 0401 binding propensity, each aligned P1-P6 sequence was 
scored by multiplying, for each position, the ARB value of the appropriate amino acid. 
According to this procedure, a numerical "algorithm score" was derived. If multiple 
P1-P6 alignments were possible, binding scores were calculated for each one and the best 

10 score was selected. The efficacy of this method in predicting 0401 binding capacity is 
shown in Table Ha. 

Considering only peptides with algorithm scores above -17.00 narrowed the 
set of predicted peptides to 156. This set still contained 72 out of 80 (90%) of the total 
high or intermediate DR binders. Raising the cut-off to an algorithm score of -16.44 or 

15 higher still allowed identification of 60 out of 80 (75%) of the DR4w4 binding peptides. 
Of the whole 107 peptide set, twenty-five of them were either good or intermediate 
binders. In other words, as expected, increasing the algorithm score stringency predicted 
a smaller fraction of the total binders present in the set, but at the same time less false 
positive peptides were identified. 

20 

Blind test of the predictive power of the DRB10401 specific algorithm 

To verify that the predictive capacity of our algorithm was not merely a 
reflection of having utilized the same data set to test and define the algorithm itself, we 
further examined its efficacy in a blind prediction test. For this scope we utilized data 

25 from an independent set of 50 peptides, whose binding affinities were known, but that had 
not been utilized in the derivation of the algorithm. As shown in Table lib, the algorithm 
was effective in predicting DR4w4 binding capacity of this independent peptide set. The 
algorithm score of -17.00 identified a total 18 peptides. This set contained 3/3 (100%) of 
all good binders, and 8/11 (70%) of all intermediate binders in the entire test set of 50 

30 peptides. Increasing the cut-off value to -16.44, identified a set of nine peptides. Seven 
of them (78%) were either good or intermediate binders. This set contained 7 out of 14 
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(50%) of the binders contained in the blind prediction peptide set. In conclusion, these 
data supports the validity of the DR4w4 specific algorithm described above. 

Detailed maps of DRB10401, DRB10101, and DRB10701 peptide binding specificities 

5 Next, we analyzed the binding to purified DR1 and DR7 molecules for the 

same set of 384 peptides utilized to define the DR4w4 algorithm. It was found that this set 
contained 120 and 59 binders for the DR1 and DR7 alleles, respectively. A total of 158 
peptides were capable of binding either DR1, DR4w4 or DR7. A large fraction of them 
(73/158; 46%) were also degenerate binders, which bound two or more of the three alleles 
10 thus far considered. Furthermore, we also found that more than 90% of the DR1 or DR7 
good and intermediate binders carried the P1-P6 motif. Most importantly, 72 out of 73 
(99%) degenerate DR binders carried this motif (data not shown). In conclusion, this 
analysis suggests that P1-P6 based algorithms might be utilized to effectively predict 
degenerate DR binders. 

15 In analogy with what was described above for DR4w4 molecules, specific 

algorithms were designed for the DR1 and DR7 alleles. Figures 2 A and 2B detail the 
allele specific maps defined according to this method. 

As in the case of DRB 10401, most secondary effects were concentrated in 
positions 4, 7 and 9. Position 4 was especially prominent in the case of DR1, while 

20 position 7 was the most prominent secondary anchor for DR7. Specific algorithms were 
developed based on these maps, and it was found that the cut-off values necessary to 
predict 75% or 90% of the binders were -19.32 and -20.28 for DR1, and 20.91 and 
-21.63 for DR7, respectively. Depending on the particular allele or cut off value selected, 
40 to 60% of the predicted peptides were in fact good or intermediate binders (data not 

25 shown). 

Development of a DR1-4-7 combined algorithm 

Finally, we examined whether a combined algorithm would allow to predict 
degenerate binders. For this purpose, the sequences of the 384 peptides in our database 
30 were simultaneously screened with the three (DR1, 4w4, and 7) specific algorithms. It 

was found that an even 100 peptides were predicted (using the 75% cut off) to bind either 
two or three of the alleles considered. This set contained 59 out of 73 (81 %) of the 



WO 98/32456 ^ PCT/US98/01373 

peptides which were in fact capable of degenerate 1-4-7 binding (defined as the capacity to 
bind to more than one of the DR1, 4w4 or 7 alleles) (Table III). 



Definition of a target set of DR specificities, representative of the world population 

5 The data presented in the preceding sections illustrates how peptides capable 

of binding multiple DR alleles can be identified by the use of a combined " 1-4-7" - 
algorithm. Next, we wished to examine whether the peptides exhibiting degenerate 1-4-7 
binding behavior would also bind other common DR types as well. As a first step in our 
experimental strategy, we sought to define a set of target DR types representative of a 

10 large (> 80%) fraction of the world population, irrespective of the ethnic population of 

origin. For this purpose, seven additional DR antigens were considered. For each one of 
the DR antigens considered in this study, (including DR1, 4 and 7), the estimated 
frequency in various ethnicities, according to the most recent HLA workshop (11th, 1991) 
is shown in Table IVa, together with the main subtypes thus far identified. 

15 For the purpose of measuring peptide binding affinity to the various DR 

molecules, one representative subtype for each DR antigen was chosen (Table I). It 
should be noted that for most antigens, either one subtype is by far the most abundant, or 
alternatively a significant degree of similarity in the binding pattern displayed by the 
different, most abundant subtypes of each DR antigen is likely to exist (see comments 

20 column of Table IVb) . One exception to this general trend is represented by the DR4 

antigen, for which significant differences in peptide specificity between the 0401 and 0405 
have been reported. Since both alleles are quite frequent (in Caucasians and Orientals, 
respectively) we included both DR 0401 and 0405 in the set of representative DR binding 
assays. 

25 Our set of representative assays is mostly focused on allelic products of the 

gene, because these molecules appear to be the most abundantly expressed, serve as the 
dominant restricting element of most human class III responses analyzed thus far, and 
accurate methods for serologic and DNA typing most readily available. However, we 
have also considered in our analysis assays representative of DRB3/4/5 molecules (Table 

30 IVc) . These molecules serve as a functional restriction element, and their peptide binding 
specificity has been previously shown to have certain similarities to the specificity of 
several common DRP 2 allelic products. 
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A general strategy for prediction of DR-degenerate binders. 

To test whether the 1-4-7 combined algorithm would also predict degenerate 
binding to other common DR types, we measured the capacity of three different groups of 
synthetic peptides to bind the panel of purified HLA DR molecules. The three different 
5 peptide sets were: A) 36 peptides which did not score positive in the combined 1-4-7 
algorithm (non-predictions), B) 36 peptides which did score positive for the 1-4-7 
algorithm, at the 75% cut off level, but had been found upon actual testing not to be 
degenerate 1-4-7 binders ("wrong" predictions), and C) 29 peptides which scored positive 
in the 1-4-7 algorithm, and also proved upon experimental testing, to be actual 1-4-7 
10 degenerate binders (correct predictions). The results of this analysis are shown in Table 
V. 

Within the set of " non-predictions" peptides (Table Va) only 3 out of 34 
(9%) bound at least two of the DR1, 4w4 or 7 molecules. Interestingly, 2 (1136.04 and 
1136.29) out of 3 of these peptides were also rather crossreactive, and bound additional 

15 DR types (DR2w2 02, DR4wl5, 5wll and 8w2 in the case of 1136.04, and 2w2 p2, 

4wl5, 9 and 5wl2 in the case of 1136.29). Peptides from the "wrong predictions" peptide 
set (Table V5), by definition bound at the most only one of the DR1, 4w4 or DR7 
molecules, and were also poorly degenerate or other DR types with 
only two peptides (1136.22 and 1188.35) binding a total of three DR molecules. Within 

20 this peptide set, no peptide bound four or more of the DR molecules tested (data not 
shown). 

These results are contrasted by data obtained with the peptide set 
corresponding to peptides which were first predicted by the use of the combined 1, 4, 7 
algorithm, and then experimentally found to be degenerate DR 1-4-7 binding. Fourteen out 
25 of 29 peptides tested (48%) bound a total of five or more alleles. Four of them were 
remarkably degenerate (1188. 16, 1188.32, 1188.34 and F107.09) and bound a total of 
nine out of the 11 DR molecules tested. In conclusion, these results suggest that a strategy 
based on the sequential use of a combined DR1, 4, 7 algorithm and quantitative DR1, 4, 7 
binding assays can be utilized to identify broadly crossreactive DR binding peptides. 



30 
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Definition of the HLA-DR 1-4-7 supertype 

The data presented above also suggested that several common DR types are 
characterized by largely overlapping peptide binding repertoires. When this issue was 
analyzed in more detail, by analyzing the binding pattern of the thirty-two peptides from 
5 Table Va and b which were actual DR 1-4-7 degenerate binders. Thirty-one of them (97%) 
bound DR1, 22 (69%) DR4w4 and 21 (66%) DR7. These files are contrasted with the 
low percentages of binding observed amongst the remainder non-degenerate binding 
peptides (17/67 (25%), 8/67 (12%) and 7/67 (10%), for DR1, 4w4 and 7, respectively) 
(Table VII). 

10 Interestingly, a large fraction of the 1-4-7 degenerate binders also bound 

certain other common DR types. Sixteen (50%) bound DR2w2a, 18 (56%) DR6wl9, 18 
(56%) DR2w2b and 20 (62%) DR9. In all cases, the frequency of binding in the non-1-4- 
7 degenerate peptide set was much lower (Table VIII). 

Significant, albeit lower, frequencies of cross reactivity were noted also for 

15 DR4wl5, DR5wll, and DR8w2 (in the 28 to 37% range). Finally, negligible levels of 
cross reactivity were observed in the case of DR3 and 5wl2 and DR53. Further studies 
will address whether either of these two group of molecules (DR4wl5, 5wll, and 8w2 on 
one hand, and DR3, DR53 and 5wl2 on the other) might belong to different DR 
supertypes. 

20 In conclusion, these data demonstrates that a large set of DR molecules 

encompassing DR1, 4w4, 2w2a, 2w2b, 7, 9 and 6wl9 is characterized by largely 
overlapping peptide binding repertoires. 

Discussion 

25 In the present report we have analyzed the peptide binding specificity of a 

set of 13 different DR molecules, representative of DR types common among the 
worldwide population. Detailed maps of secondary anchors and secondary interactions 
have been derived for three of them (DR4w4, DR1 and DR7). Furthermore, we 
demonstrated that a set of at least seven different DR types share overlapping peptide 

30 binding repertoires; and consequently that broadly degenerate HLA DR binding peptides 
are a relatively common occurrence. This study also describes computerized procedures 
which should greatly assist in the task of identification of such degenerate peptides. 
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We would like to discuss the data in the context of our current 
understanding of peptide-class II interactions, as well as in the context of the recently 
described class I supermotifs. Finally, the potential implications of broadly degenerate 
class II epitopes for epitope based vaccine design should also be considered. 
5 Firstly, our studies illustrate how the vast majority of the peptides binding 

with good affinity to DR4w4, DR1, DR7 and most of the other DR types analyzed in the 
current study (data not shown), are all characterized by a P1-P6 motif consistent with the 
one originally proposed by O 1 Sullivan, et aL Crystallographic analysis of DRl-peptide 
complexes revealed that the residues occupying these positions engage two complementary 

10 pockets on the DR1 molecule, with the PI position corresponding to the most crucial 
anchor residue and the deepest hydrophobic pocket. Our analysis also illustrates how 
other "secondary anchor" positions drastically influence in an allele-specific manner 
peptide binding capacity. Position 4 was found to be particularly crucial for DR1 binding, 
position 9 for DR4w4, and position 7 for DR7. These data are consistent with previous 

15 results which originally described such allele-specific anchors, and with crystallographic 
data which illustrates how these residues engage shallow pockets on the DR molecule. 

Secondly, our studies illustrate how an approach based on alignment and 
calculation of average relative binding values of large peptide libraries allows definition of 
quantitative algorithms to predict binding capacity. The present study extends those 

20 observations to two other common HLA-DR types, and also illustrates how the combined 
use of the 1-4-7 algorithms can be of aid in identifying broadly degenerate DR binding 
peptides. 

The data presented herein suggest that a group of common DR alleles, 
including at least DR1, DR2w2a, DR2w2b, DR4w4, DR6wl9, DR7 and DR9 share a 

25 largely overlapping peptide repertoire. Degenerate peptide binding to multiple DR alleles, 
and recognition of the same epitope in the context of multiple DR types was originally 
described by Lanzavechia, Sinigallia's and Rothbard's groups. The present study provides 
a classification of alleles belonging to a main HLA-DR supertype (DRl-4-7-like) which 
includes DR1, DR2w2a, DR2w2b, DR4w4, DR7, DR9, DR6wl9. On the basis of the 

30 data presented herein, at least two additional groups of alleles exist. The first group 

encodes for molecules with significant, albeit much reduced overlap with the 1-4-7-like 
supertype (DR4wl5, 8w2, 5wll). The second group of alleles (5wl2, 3wl7, and w53) 
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clearly has little repertoire association with the 1-4-7 supertype. In this context it is 
interesting to note that Hammer, et al. noted that good DRSwll binding peptides are 
frequently characterized by positively charged P6 anchor (which would be poorly 
compatible) with the herein proposed 1-4-7 supermotif. It is also interesting to note that 
5 Sidney, et al proposed that DR3wl7 binds a set of peptides largely distinct from those 

bound by other common DR types. Future studies will have to determine whether any of 
the molecules listed above can be grouped in additional DR supertypes. Our group is 
currently investigating whether analysis of polymorphic residues lining the peptide binding 
pockets of DR can be utilized to aid in the classification and prediction of HLA DR 
10 supertypes. 

We would like to comment on similarities and differences between the HLA 
DR supertype described herein and the recently described HLA class I supermotifs. Class 
I supermotifs are clear-cut and, as a rule, non-overlapping. Four of them have been 
described all approximately equally frequent amongst the worldwide population. By 

15 contrast, the repertoire defining the HLA DR supertype herein described is not clear-cut 
and overlaps, at least in part, with the repertoire of other alleles. It also appears that on 
the basis of the data presented in Tables I and IV, even if other DR supertypes exist, the 
DR1-4-7 is going to be by far the most abundantly represented worldwide. 

Finally, we would like to point out the possible relevance of these data in 

20 terms of development of epitope based vaccines. Class II restricted HTL have been 

implicated in protection from, and termination of many important diseases. Inclusion of 
well defined class II epitopes in prophylactic or therapeutic vaccines may allow to focus 
the immune response towards conserved or subdominant epitopes, and avoid suppressive 
determinants. Based on the data presented herein (Table IV), the DR1-4-7 supertype 

25 would allow coverage in the 50 to 80% range, depending on the ethnicities considered. It 
is thus possible that broad and not ethnically biased population coverage could be achieved 
by considering a very limited number of peptide binding specificities. 

Based on the results present above, the sequences of various antigens of 
interest were scanned for the presence of the DR 1-4-7 motifs. Peptides identified using 

30 this approach are broadly cross reactive, class II restricted T cell epitopes. Table VIII 
presents a listing of such peptides derived from HBV, HCV, HIV and Plasmodium 
falciparum (Pf). A total of 146 peptides were identified: 35 from DHBV, 16 from HCV, 
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27 

50 from HIV, and 45 from Pf. Standard conservancy criteria were employed in applicable 
cases. 

The above examples are provided to illustrate the invention but not to limit 
its scope. Other variants of the invention will be readily apparent to one of ordinary skill 
5 in the art. All publications, patents, and patent applications cited herein are hereby 
incorporated by reference for all purposes. 
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Table II 



An algorithm to predict DRB1*0401 binding capacity, 
a) Original peptide set 



Selection 
Criteria 



No. of peptides (Binding nM) 



High 
£100 



Inter. 
100-1000 



Non 
>1000 



Total 



None 
P1-P6 
-17.00 " 
-16.44 a 



27 
27 
27 
25 



53 
50 
45 
35 



304 
182 
84 
47 



384 
259 
156 
107 



1) 
2) 



Algorithm score which predicts 90% of all binders. 
Algorithm score which predicts 75% of all binders. 
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Table II 



b) Blind test of the predictive power of the DRB1*0401 algorithm. 



Selection 
Criteria 



No. of peptides (Binding nM) 
High Inter. Non 

<noo 100-1000 >iboo 



Total 



None 
P1-P6 
-17.00 
-16.44 



3 
3 
3 
3 



11 
9 
8 
4 



36 
28 
7 
2 



50 
40 
18 
9 
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Table III 

A combined "1-4-7" algorithm. 



Selection 
Criteria 



Degenerate 
Binders " 



Percent of Total 
Degenerate Binders 



None 
P1-P6 



Combined Algorithms 
(90% Cutoff Value) 



73/384 
72/259 

67/147 



100% 
99% 

92% 



Combined Algorithms 
(75% Cutoff Value) 



59/100 



81% 



1) Degenerate binders are defined as peptides binding at least two out of the three 
DR1, 4w4, and 7 molecules with an IC50 of 1 uM or less. 
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Table IV 

Fhenotypic frequencies of 10 prevalent HLA-DR antigens 



Anrigen 


Alleles 


Cauc. 


Phenotypic Frequencies 
Blk. Jpn. Chn. Hisp. 




DR1 


DRBl*0101-03 


18.5 


8.4 


10.7 


45 


10.1 


10.4 


DR2 


DRBl»1501-03 


19.9 


14.8 


30.9 


22.0 


15.0 


20.5 


DR3 


DRB1-0301-2 


17.7 


19.5 


0.4 


7.3 


14.4 


11.9 


UK4 


DRB1»0401-12 


23.6 


6.1 


40.4 


21.9 


29.8 


24.4 


DR7 


DRB1*0701-02 


26.2 


11.1 


1.0 


15.0 


16.6 


14.0 


DR8 


DRBl»0801-5 


5.5 


10.9 


25.0 


10.7 


23.3 


15.1 


DR9 


DRB1»09011,09012 


3.6 


4.7 


245 


19.9 


6.7 


11.9 


DR11 


DRB1M101-05 


17.0 


18.0 


4.9 


19.4 


18.1 


15.5 


DR12 


DRBl*1201-02 


2.8 


5.5 


13.1 


17.6 


5.7 


8.9 


DR13 


DRB1M301-06 


21.7 


165 


14.6 


12.2 


10.5 


15.1 


Total 




97.0 


83.9 


98.8 


95.5 


95.6 


94.7 
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Table VII 



Frequency of Binders 



DRType 


1-4-7 
Degenerate 
Binders (%) 


Non 1-4-7 
Degenerate 
Binders (%) 


1 

4w4 
7 


31/32 (97) 
22/32 (69) 
21/32 (66) 


17/67 (25) 
8/67 (12) 
7/67 (10) 


9 

6wl9 
2w2Gb 
2w2fia 


20/32 (62) 
18/32 (56) 
18/32 (56) 
16/32 (50) 


2/67 (3.0) 
6/67 (8.9) 
16/67 (24) 
10/67 (15) 


4wl5 
8w2 
5w11 


12/32 (37) 
10/32 (31) 
9/32 (28) 


4/67 (6.0) 
3/67 (45) 
6/67 (8.9) 


5wl2 
3wl7 
w53 


3/32 (9.4) 
1/32 (3.1) 
2/16 (13) 


4/67 (6.0) 
0/67 (0) 
7/43 (16) 
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Table VIII 



Sequence 


Source 


lit 

roa 


Conservancy 


Predicted 
1-4-7 


IGPFMKAVCVEVEKT 


Pf TRAP 


227 


100 


a 

o 


ILSVFFLALFFHFN 


Pf EXP1 


3 




"l 

«j 


K5KYKLAT5VLAGLL 


Pf EXP1 


71 




-2 


KYKLAT5VLAGLLGN 


Pf EXP1 


73 




•3 


LCNVKYLVTvTLIFF 


Pf TRAP 


4 


100 


O 


I^VFFLALFHEFNK 


Pf EXP1 


4 




a 

j 


LVNLUFHINGKnK 


Pf LSA1 


13 




a 
j 


MKILSVFFLALFHI 


Pf EXP1 1 


1 




a 


MRKLAILSVSSFLFV 


Pf CSP 


2 


95 


3 


NSSICL1MVLSFLFL 


Pf CSP 


417 


95 


3 


NVKYLVIVFLIFFDL 


Pf TRAP 


6 


100 


3 


SFYFTLVNLUFHIN 


Pf LSA1 


8 




3 


V FFLA LFFIIFNKES 


Pf EXP1 


6 




3 


YFILVNLLIFHINGK 


Pf LSA1 


10 




3 


YISFYFILVNLLIFH 


Pf LSA1 


6 




3 


ACLLCNVSTVLLCGV 


Pf EXP1 


82 




2 


ANQLWILTDGJPD5 


Pf TRAP 


153 


100 


2 


AYKFVVPCAATPYAG 


Pf TRAP 


514 


80 


2 


DKELTMSNVKNVSQT 


Pf LSA1 


81 




2 


FNWNSSIGUMVLS 


Pf CSP 


413 


100 


2 


FYFILV NLLIFHI NG 


Pf LSA1 


9 




2 


GLAYKFWPGAATPY 


Pf TRAP 


512 


80 


2 


GRDVQNNrVDEIKYR 


Pf TRAP 


25 


90 


2 


HJLYISFYFILVNLL 


Pf LSA1 


3 




2 


HNWVNHAVPLAMJCLI 


Pf TRAP 


'62 


80 


2 


IVFUFFDLPLVNGR 


Pf TRAP 


12 


100 


2 


KFWPGAATPYACEP 


Pf TRAP 


516 


80 


2 


KSLLRNLGVSENIFL 


Pf LSA1 


98 




2 


KYLVTVFLUhFDLFL 


Pf TRAP 


8 


100 


2 


LAGLLGNV51VLLGG 


Pf EXP1 


81 




2 


LGNVSTVLLGGVGLV 


Pf EXP1 


85 




2 


T TFFDT FT VMH?nvn 


Pf TRAP 


15 


100 


2 


LWILTDGIPDSIQD 


Pf TRAP 


156 


100 


2 


QLWTLTDGIPDSIQ 


Pf TRAP 


155 


100 


2 


RGYYIPHQ5SLPQDN 


Pf LSA1 


1669 




2 


RHNWVNHAVPLAMKL 


Pf TRAP 


61 


80 


2 


RHPFKIGSSDPADNA 


Pf EXP1 


107 




2 


SSVFNWNS5IGLIM 


Pf CSP 


410 


95 


2 


VFNWNSSIGUMVL 


Pf CSP 


412 


95 


2 


VKNVIGPFMKAVCVE 


Pf TRAP 


223 


100 


2 


VKYLVIVFLIFFDLF 


Pf TRAP 


7 


100 


2 


VSTVLLGCVCLVLYN 


Pf EXP1 


88 




2 


WENVKNVIGPFMKAV 


Pf TRAP 


220 


100 


2 


YKFWPGAATPYAGE 


Pf TRAP 


515 


80 


2 
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Table VIII 



PCT/US98/01373 





sequence S 


iource 


1st 
Pos 


Conservancy 


Predicted 
1-4-7 


I 


^RWQVMIVWQVDRM I 


-flVl VIF 


2 


81 


3 


I 


ZRYLKDQQLLGTWGCS I 


HV1 ENV 


589 




3 


] 


ESELVSQUEQLIKK I 


-nvi pol 


6% 


80 


3 


] 


^RKYTAFTIPSINNE I 


-nvi pol 


303 


93 


3 


< 


SQM VHQ AISPRTLN A I 


-HV1 GAG 


172 


88 


3 


] 


PEYVEFVNTPPLVKL 1 


■HV1 POL 


593 


93 


3 


] 


LPPWAKHVASCDK 1 


4F/1 POL 


770 


87 


3 


] 


NJRE1LKEPVHGVYYD 3 


tfTVl POL 


4*5 


87 


3 


] 


PAIFQSSMTKILEPF 1 


t-flVl POL 


336 


80 


3 


1 


PPVV AKEIVASCDKC 1 


hHVl POL 


771 


87 


3 




QEQIGWMTN NPPIPV J 


H0TV1 GAG 


276 


81 


3 




QGQMVHQAISPRTLN 


HIV1 GAG 


171 


85 


3 




5PAIFQSSMTKILEP 


HTV1 POL 


335 


80 


3 




TLNFPISPICTVPVK 


HTV1 POL 


176 


100 


3 




VKNWMTETLLVQNAN 


HTV1 GAG 


348 


81 


3 




VPVWKEATTTLFCAS 


HTV1 ENV 


54 


81 


3 




WEFVNTPPLVKLWYQ 


HTV1 POL 


596 


93 


3 




WVKWEEKAJF5PEVI 


HIV GAG 


187 


33 


3 




YYGVPVWKEATTTLF 


HtVl ENV 


51 


83 


3 


ASDFNLPPWAKETV 


HP/1 POL 


765 


80 


2 


ASGYIEAEVIPAETG 


HIV1 POL 


822 


93 


2 


DFNLPPVVAKEIVAS 


HP/1 POL 


767 


87 


2 


EAELRILQQLLFIHF 


HP/1 VPR 


58 


82 


2 


EKVYLAWVPAHKGIG 


HIV1 POL 


711 


93 


2 


ETAYFLLKLAGRVVPV 


HIV POL 


838 


65 


2 


EVQLCIPHPACLKKK 


HtVl POL 


268 


80 


2 


FWEVQLGIPHPAGLK 


HT/1 POL 


266 


100 


2 


GCTLNFP1SPIETVP 


HP/1 POL 


174 


100 


2 


GHYKRWIILGLNKI 


HIV1 GAG 


294 


85 


2 


GTVLVGPTPVNIIGR 


HIV1 POL 


153 


100 


2 


HXA1GTVLVGP1PVN 


HTV1 POL 


149 


93 


2 


IGTVLVGFTPYNIIG 


HIV POL 


152 


74 


2 


KRW1ILGLNKIVRMY 


HIV1 GAG 


298 


88 


2 


KV YLAWVP AHKGIGG 


HIV POL 


712 


74 


2 


LICTTAVPYVNASWSNK 


HIV1 ENV 


607 




2 


LLQLTVWGIKQLQAR 


HP/1 ENV 


731 


80 


2 


NFPISPIETVPVKLK 


HIV1 POL 


178 


100 


2 


PQGWKGSPA1FQSSM 


HP/1 POL 


329 


87 


2 


PVNOCRNLLTQIGC 


HT/1 POL 


161 


87 


2 


QHLLQLTVWGIKQLQ 


HP/1 ENV 


729 


80 


2 


QQHLLQLTVWGIKQL 


HIV1 ENV 


728 


80 


2 


SPEVTPMFSALSEGA 


HT/1 GAG 


197 


88 


2 


TKELQKQITKIQNFR 


HIV POL 


952 


67 


2 


TVLVGPTPVNI1GRN 


HP/1 POL 


154 


100 


2 


VEAIIRILQQLLFIH 


HP/1 VPR 


57 


82 


2 


VIPMFSALSEGATPQ 


HP/1 GAG 


200 


88 


2 


VNIIGRNLLTQIGCT 


HP/1 POL 


162 


87 


2 


WCCSGKL1CTTAVPWN 


HP/1 ENV 


601 




2 


WIILGLNKT/RMYSP 


HTV1 GAG 


300 


88 


2 


YKRWULGLNKIVRM 


HP/1 GAG 


297 


88 


2 


FILVNLLIFHINGKI 


Pf LSA1 


n ! 


3 
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Table VIII 



Sequence 


Source \ 


1st 
Pos 


Conservancy 


Predicted 
1-4-7 


AEDLNLGNLNVSIPYV 


HBV POL 


JO 




3 


DLNLGNLNVSIPWTH 


HBV POT 


ah 

•Hi 


oc 


3 


GhhLLlHILTIPQSL 


HBV ENV 

A 1L> \ GIN V 


1fi1 


on 

ou 


3 


EFLFILLLCUFLLV 


HBV ENV 




on 
oU 


3 


NLNVSIPWTHKVGNF 


HBV POL . 




oc 


3 


PFLLAQFT5AICSW 


HBV POL 




ac 


3 


RF5WLSLLVPFVQWF 


HBV ENV 




IUU 


3 


SFFLLAQFTSAICSV 


HBV POL 




oc 


*> 
3 


5VRF5WLSLLVPFVQ 


HBV ENV 


lift 


on 
OU 


3 


APSYMDDWLGAKSV 


HBV POL 


546 


on 


z 


AGFFLLTRILUPQS 


HBV ENV 


180 


fin 
ou 


Z 


FVQWFVGLSPTVWLS 


HBV ENV 


342 


oc 


2 


GAHLSLRCLPVCAFS 


HBV X 


50 


on 


Z 


GT5FVYVPSALNPAD 


HBV POL 


774 


fin 


Z 


GVWERTPP A YRPPN A 


HBV NUC 


123 


OC 


Z 


HLSLRCLPVCAFSSA 


HBV X 


52 


on 


z 


riFLFTLLLCUFLL 


HBV ENV 


244 


fin 

Ou 


z 


ILLLCUFLLVLLDY 


HBV ENV 


249 


95 


Z 


rVCLLGFAAPJKl'QCC 


HBV POL 


636 


90 


X. 


KF A VP NLQSLTNLLS 


HBV POL 


406 


95 




LAQFTSA ICSWRRA 


HBV POL 


526 


95 


mm 


LCUFLLVLLDYQGM 


HBV ENV 


252 


95 


2 


LCQVFADATKI C WGL 


HBV POL 


694 


95 


2 


LHLYSHPIILGFRKI 


HBV POL 


501 


80 


2 


LL CLIFLL VLLD YQG 


HBV ENV 


251 


95 


2 


LVLLDYQGMLPVCPL 


HBV ENV 


258 


90 


2 


LVPFV QWFVCLSP IV 


HBV ENV 


'339 


95 


2 


PLPIHT AELLA ACFA 


HBV POL 


722 


80 


2 


QCGYPALMPLYAOQ 


HBV POL 


648 


95 


2 


RDLLDTASALYREAL 


HBV NUC 


28 


80 


2 


SFGVWIRTPPAYRPP 


HBV NUC 


121 


90 


2 


SWL5RKYTSFPWLL 


HBV POL 


750 


85 


2 


VGLLGFAAPFTQCGY 


HBV POL 


637 


95 


2 


VPN LQSLTN LLSSNL 


HBV POL 


409 


85 


2 


WPKFAVPNLQSLTNL 


HBV POL 


404 


95 


2 


KVLVLNPSV A ATLGF 


HCV 


1255 


100 


3 


PTLWARMILMTHFF5 


HCV 


2870 


79 


3 


ADLMGYIPLVGAPLG 


HCV 


131 


79 


2 


AVQWMNRUAFASRG 


HCV 


1917 


100 


2 


DLEUT5CS5NVSVA 


HCV 


2812 


93 


2 


DLYLVTRHADVIPVR 


HCV 


1134 


79 


2 


EDLVNLLPAILSPGA 


HCV 


1882 


79 


2 


FTTLPALSTGUHLH 


HCV 


684 


79 


2 


GARLWLATATPPGS 


HCV 


1345 


79 


2 


GIQYLAGLSTLPGNP 


HCV 


1776 


100 


2 


CVNYATCNLPGCSF5 


HCV 


161 


79 


2 


IQYLAGLSTLPGNPA 


HCV 


1777 


100 


2 


LH GLS AF5LHSYSPG 


HCV 


2919 


79 


2 


VNLLPAILSPGALW 


HCV 


1885 


79 


2 


VQWMNRLIAFASRGN 


HCV 


1918 


100 


2 


1 YKVLVLNPSV AATLG 


HCV 


1254 


100 


2 
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Class II Peptides 



Peptide 


A A 


Sequence 


Source 


00 8.00 


1 6 


SAt i SSDITASVMCAK 


HEL 81-06 


200*06 


1 6 




HIV gp25 41-56 


d- 1 J.i u 


1 6 


NKALELFRKDIAAKYK 


Sp. W. myo. 132-147 


506.01 


20 


NKALE-LFRKDIAAKYKELGY 


SW Myo 132-151 




1 8 


ALELFRKDIAAKYKELGY 


Sp. W myo. 134-151 


506,0b 


1 6 


ELFRKD1AAKYKELGY 


Sd W mvo. 136-151 


570.01 


1 6 


MAKTI AYDEE ARP^G LE 


Hoa| Shock Prot 


705.06 




KVYI PRMKMPFKYNLTSVLM 


Ova 270-298 


71 7.04 


1 4 


YASFVKTTTLRKFT-NH2 


corrtbinBtorta!: DR2 ooiimized 


857.02 


9 n 


PHH T ALRO A! LCWQ PL MTLA 


HBV corn 50-69 


865.01 


1 5 




OVA KM core extension 


r050.03 


20 


GFYTTGAVROI FGDYKTTIC 


PLP 91-110 


rOoa.Di 




ON! LLSN APLGPQFP 


Tvrotinn^A 66-70 

1 TIWOIIiaoB WW W w 


rnnn m 
rOyo.UJ 


20 


A AYA AOGY KVLVLNPS VAAT 


HCV NS3 1242-1261 


F098.04 


9 o 


GYKVLVLNPSVAATLGFGAY 


HCV NS3 1248-1267 


r- OBB.Ob 


1 4 


GYKVLVLNPSVAAT 


HCV NS3 1248-1261 


F098.0B 


1 9 


SYV^^TNMGU<FRQLLWFHI 


HBV Care 87-105 


h09B .1 O 


1 2 




HBV Core 94-105 


F1 34,04 


? n 


TLH GFTP1_LY RLG A VONE-lT 


HCV NS4 1-20 


C -1 1 A f\ K 

r 1 


2 0 


NFISGIQYLAGLSTLPQNPA 


HCV NS4 151-170 


h 1 34. Uo 


2 1 


GEGAVOWMNR U AFASRGNHV 


HCV NS4 293-313 (1914-1934) 


1 A«pD 


1 7 


KPVSOMRMATPLLMRPM 


Mouso invariant chain 65-101 


Tf 5fl ml 


24 


LPKPPKPVSKMRMATPLLMOALPM 


Human invariant chain 80-103 


77 097Q 


1 5 


EYLVSFGVW1RTPPA 


HBV NUC 117 


27.0280 


1 5 


GVWIRTPPAYRPPNA 


HBV NUC 123 


27.028 1 


1 5 


RHYLHTLWKAG1LYK 


HBV POL 145 


27.0283 


1 5 


VPNLQStTNLLSSNL 


HBV POL 409 


27.0288 


1 S 


WVTVYYGVFVWKEAT 


HIV1 ENV 47 


27.0293 


1 s 


YYGVPVWKEaTTTLF 


HIV1 ENV 51 


27.0294 


1 5 


VPVWKEATTTLFCAS 


HIV1 ENV 54 


27.0295 


1 5 


USGIVOOQNNU-RAI 


HIVi ENV 711 


27.0296 


1 S 


QQHLJLQLTWGIKQL 


HIV1 ENV 728 


27.0297 


1 5 


QHLLQLTWVd KQLD 


HIV1 ENV 729 


27 .029B 


1 5 


LLQLTVWG1KQLQAR 


H1V1 ENV 731 


27 .0304 


1 5 


QGQMVHQAJSPRTLN 


HIV1 GAG 171 


c. i .UJU / 


1 5 


SPEVIPMFSALSEGA 


HIV1 GAG 197 


^7 n*7 1 n 


1 5 


QEQJGWMTNNPPIPV 


HIVI GAG 276 


97 n*Ji 1 


1 5 


GEIYKRW11LGLNK1 


HIV1 GAG 294 


97 mi 9 


1 5 


YKRWI ILGLNKIVRM 


H1V1 GAG 297 


27 03 1 3 


1 S 


KR WIILGLNKIVR MY 


HIVI GAG 298 


27 0314 


1 5 


WULGLNKIVRMYSP 


HIVi GAG 300 




1 5 


VKNVVMTETLLVaNAN 


HIV1 GAG 348 


97 m99 


1 5 


GTVLVGPTP VNI IGR 


HIV1 POL 153 


27.0324 


1 5 


PVNI1GRNLJLTQ1GC 


HIV1 POL 161 


27.0326 


1 5 


GRNLLTOIGCTLNFP 


HIV1 POL 166 


27.0328 


1 5 


TLNFPISPIETVPVK 


HIV1 POL 176 


27.0329 


1 5 


NFPISPIETVPVKLK 


HIV1 POL 178 


27.0341 


1 5 


FRKYTAFTIPSINNE 


HIVI POL 303 


27.0344 


15 


SPAIFQSSMTKILEP 


HIV1 POL 335 


27.0345 


1 s 


PAIFQSSMTK1LEPF 


HIVI POL 336 


27.0349 


1 5 


QKLVGKLNWASQIYA 


HIV1 POL 437 


27.0350 


1 5 


VGKLNWASQJYAGIK 


HIVI POL 440 


27.0351 


1 5 


NREILKEPVHGVYYD 


H1V1 POL 485 


27.0353 


15 


1PEVVEFVNTTPPLVKL 


HIV1 POL 593 


27.0354 


1 5 


VSf^FVNTPPUVKLVVYQ 


HIVI POL 595 


27.0360 


15 


EQLIKKEKVYLAWVP 


HIV1 POL 705 
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Peptide A A Sequence 
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Class U Peptides 

Source 



27.0361 


1 5 


EKVYIAWVPaH kgig 


HIV1 POL 711 


27.0364 


1 5 


HSNWRAMASDFNLPP 


HIV1 POL 758 


27.0370 


1 5 


ASGYiEAEViPAETG 


HIV1 POL 822 


27.0372 


1 5 


AEHLKTAVQMAVFIH 


HIV1 POL 911 


27 .0373 


1 5 


KTAVQMAVRHNFKR 


HIVl POL 915 


27.0377 


1 5 


QKQTTKIQNFRVYYR 


HIV1 POL 956 


27.0379 


1 5 


KLLWKGEGAWIQDN 


H1V1 POL 082 


27.038 1 


1 5 


ENRWQVMIVWQVDRM 


HIV1 VI F 2 


27 0382 


1 5 


VEAJIRJLQQLLF1H 


HIV1 VPR 57 


27.0384 


1 5 


FNIWNSSIGUMVLS 


Pf CSP 413 


27.03B7 


1 5 


MNYYGKQENWYSLKK 


Pt CSP 53 


?7 03RS 


1 5 


MRKLAILSVSSFLFV 


Pf CSP 2 


27.0390 


1 5 


NSSIGUMVLSFLFL 


Pf CSP 417 


27-0392 


1 5 


SSVFNWNSSIGUM 


Pf CSP 410 


27.0393 


1 5 


MK1LSVFFLALFFII 


Pf EXP1 1 


27.0398 


1 5 


FILVNLUFHlNGK! 


Pf LSA1 11 


27.0400 


1 5 


HILYISFYFILVNLL 


Pf LSA1 3 


27 .0402 


1 5 


LL1FHINGKIIKNSE 


Pf LSA1 16 


27.0403 


1 5 


LVNLLIFWINGK1IK 


Pf LSA1 13 


27.0406 


1 5 


NLUFHINGKIIKNS 


Pt LSA1 15 


27.0408 


1 5 


GfTMFKSLLBNLGVSE 


Pf LSA1 94 


27.0412 


1 5 


AYKFVVpGAATPYAG 


Pf SSP2 514 


27.0415 


1 5 


NVKYLVIVFUFFDL 


Pt SSP2 6 


27.0417 


1 5 


VKNVIGPFMKAVCVE 


Pf SSP2 223 


27.0418 


15 


WENVKNVIGPFMKAV 


Pf SSP2 220 


1186.04 


1 5 


CSWRRAFPHCLAFS 


HBV POL 534 


1186.06 


1 S 


FVQWFVGLSPTVWLS 


HBV ENV 342 


1186.10 


IS 


LAQFTSAICSWRRA 


HBV POL 526 


1186.15 


1 5 


LVPFVOWFVGLSPTV 


HBV ENV 339 


11B6.18 


1 6 


NLSWLSLDVSAAFYH 


HBV POL 422 


1 1 86.25 


1 5 


SFGVWIRTPPAYRPP 


HBV NUG 121 


1186. 26 


15 


SPFLLAQFTSAICSV 


HBV POL 522 


11 86.27 


IS 


SSNLSWLSLDV5AAF 


HBV POL 420 


1 188.01 


1 5 


DKELTMSNVKNVSOT 


Pf LSA1 81 


1188.13 


1 s 


AGLLGNVSTTVULGGV 


Pf EXP1 82 


1188.16 


1 6 


KSKYKLATSVLAGLL 


Pf EXP1 71 


1 188.32 


1 5 


GLAYKFVVPGAATPY 


Pf SSP2 512 


1188.34 


15 


HNWVNHAVPLAMKU 


Pf SSP2 62 


1188.35 


1 S 


IGPFMKAVCVEVEKX 


Pf SSP2 227 


1 188.38 


1 5 


KYKIAGGIAGGLALL 


Pf SSP2 494 


1188.45 


1 5 


RHNWVNHAVPLAMKL 


Pf SSP2 61 


F091.15 


1 6 


IKORNMWQEVGKAMY 


HIVl ENV 566 


F107.03 


1 5 


LQSLTNLLSSNLSWL 


HBV POL 412 


F 107.04 


1 5 


PFLLAQFTSAlCSW 


HBV POL 523 


F1 07.00 


1 5 


KYKLATSVLAGLLGM 


Pf EXP1 73 


F107.10 


1 5 


LAG LJLG NVSTVLLGG 


Pf EXP1 81 


F1 07.11 


1 5 


RHPFKK2SSDPADNA 


Pf EXP1 107 


F107.14 


1 5 


ANGLW1LTDGIPDS 


Pt SSP2 153 


F107.17 


1 5 


KFWPGAATPYAGEP 


Pf SSP2 516 


F1 07.23 


1 5 


VFNWNSS1GLIMVL 


Pf CSP 412 


35.0093 


1 5 


VGPLTVNEKRRLXU 


HBV POL 96 


35.0096 


1 5 


ESRLWDFSQFBRGN 


HBV POL 387 


35.0100 


1 5 


LCaVFADATPTGWGL 


HBV POL 683 


35.0106 


1 5 


VWVATDALMTGYTG 


HCV 1437 


35.0107 


1 5 


TVDFSLDPTFTIETT 


HCV 1466 


35.0125 


1 5 


AETFYVDGAANRETK 


HIV POL 619 



2 
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Class II Peptides 

Peptide A A Sequence Source 



35.0127 

35.0131 

35.0133 

35.01 35 

35.0171 

35.0172 

1280.02 

1280.03 

1280.04 

1280.06 

1 2 80 .08 

1280.09 

1280.12 

1280.13 

1280.1 S 

1280.16 

1280.21 

1280.22 

1280.23 

1280.25 

1283.02 

1283.10 

1283.11 

12B3.12 

1283.13 

1283.14 

1283.16 

1283.17 

12B3.20 

1283.21 

1283.22 

1283.24 

1283.26 

1283.30 

1283.31 

12B3.33 

12B3.34 

1 283.36 

12B3.37 

1283.44 

1283.50 

1283.55 

1283.57 

1283.61 

1298.02 

1298.03 

1298.04 

1298.06 

1298.07 

129B.08 

1298. 10 

1298.1 1 

1298.13 

1298.16 

F125.02 

F1 25.04 



1 5 EVNIVTDSQYAU3U 

1 5 WAG1KQEFGIPYNPO 

1 5 GAWIQDNSDI KWP 

1 5 YRKILRORKIDRUD 

1 5 POSIQDSLKESRKLN 

1 5 KCNLYADSAWENVKN 

1 5 IGTYLVGPTPVNIIG 

1 5 KVYLAWVPAHKGIGG 

1 5 TKELQKQITKIQNFR 

1 5 AGFFLLTRILT1PQS 

1 5 GFFLLTRILTIPOSL 

1 5 GTSFVYVPSALNPAD 

1 5 HFLFILLLCUFLL 

1 5 KFAVPNLQSLTNLLS 

1 5 LHLYSHPil LGFRKI 

1 5 LLCU FLLVLLDYQG 

1 5 VGLLGFAAPFTQCGY 

1 5 FYFILVNLUFHING 

1 5 KSULRNLGVSENIFL 

1 5 RGYYIPHQSSLPODN 

1 £ VYLLPRRGPRLGVRA 

1 5 GHRMAWDMMMNWSPT 

1 5 CGFVYCFTPSPVWG 

1 5 VYCFTPSPWVGTTD 

1 5 GNWFGGTWMNSTGFT 

1 5 FTTLPALSTGUHLH 

1 5 SKGWRLLAPITAYAQ 

1 5 DLYLVTRHADVIPVR 

1 5 AQGYKVLVLNPSVAA 

1 5 GYKVLVLNPSVAATL 

1 5 VLVLNPSVAATLGFG 

1 5 QARLWLATATPPGS 

1 5 DVWVATDALMTGYT 

1 5 F7GLTHI D AHFLSQT 

1 5 YLVAYQATVCARAQA 

1 5 LEWTSTWVLVGGVL 

1 5 TWVLVGGVLAALAAY 

1 5 AKHMWNRSGIQYUA 

1 5 IQYLAGLSTLPGNPA 

1 5 MNRUAFASRGNHVS 

1 5 SYTWTGALITPCAAE 

1 5 GSSYGFQYSPGORVE 

1 5 LEUTSCSSNVSVAH 

1 5 ASCLRKLGVPPLRVW 

1 5 VGNFTGLYSSTVPVF 

1 5 TNFLLSLGIHLNPNK 

1 5 KQCFRKLPVNRPIDW 

1 S KQAFTF5PTYKAFLC 

1 5 AANWILRGTSFVYVP 

1 S PDRVHFASPLHVAWR 

1 5 IRPWSTQLLLNGSL 

1 5 RSELYKYKWKIEPL 

1 5 DRFYKTLRAEQASQE 

1 5 KVILVAVHVASGYIE 
1 7 LVNLUFHINGKIIKNS 

1 6 rhnwvnhavplamku 



HIV POL 674 
HIV POL 874 
HIV POL $89 
HIV VPU 31 
Pf SSP2 165 
Pf SSP2 211 
HIV POL 152 
HIV POL 71 2 
HIV POL 952 
HBV ENV 180 
HBV ENV 181 
HBV POL 774 
HBV ENV 244, 
HBV POL 406 
HBV POL 501 
HBV ENV 251 
HBV POL 637 
Pf LSA1 9 
Pf LSA1 98 
Pf LSA1 1669 
HCV Cor» 34 
HCV E1 315 
HCV NS1/E2 506 
HCV NS1/E2 500 
HCV NS1/E2 SS0 
HCV NS1/E2 684 
HCV NS3 1025 
HCV NS3 1134 
HCV NS3 1251 
HCV NS3 1253 
HCV NS3 12S6 
HCV NS3 1345 
HCV NS3 1436 
HCV NS3 1567 
HCV NS3 1591 
HCV NS4 1658 
HCV NS4 1664 
HCV NS4 1767 
HCV NS4 1777 
HCV NS4 1921 
HCV NS5 2456 
HCV NS5 2641 
HCV NS5 2813 
HCV N55 2939 
HBV POL 53 
HBV POL 568 
HBV POL 615 
HBV POL 661 
HBV POL 764 
HBV POL 824 
HIV1 ENV 333 
HIV1 ENV 637 
HIV1 GAG 333 
HIV1 POL 813 
Pf LSA1 13 
Pf SSP2 61 
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WHAT IS CLAIMED IS: 

1 . A composition comprising an isolated peptide that induces a CTL 
response and a T helper peptide comprising a motif of about nine residues wherein the first 
position from the N terminus of the motif is Y, F, W, L, I, V, M and the sixth position 

5 from the N terminus of the motif is S, T, C, A, P, V, I, L, M. 

2. The composition of claim 1, wherein the T helper peptide consists of 
between about 10 and about 24 residues. 

10 3. The composition of claim 1, wherein the T helper peptide is derived 

from a viral antigen. 

4. The composition of claim 3, wherein the viral antigen is from HIV, 
HBV, or HCV. 

15 

5. The composition of claim 1, wherein the T helper peptide is derived 
from a parasite. 

6. The composition of claim 5, wherein the antigen is Plasmodium 

20 falciparum. 

7. The composition of claim 1, wherein the peptide that induces a CTL 
response is linked to the T helper peptide. 



25 8 . A method of inducing a CTL response in a patient, the method 

comprising contacting a cytotoxic T cell from the patient with an isolated peptide that 
induces a CTL response and a T helper peptide comprising a motif of about nine residues 
wherein the first position from the N terminus of the motif is Y, F, W, L, I, V, M and the 
sixth position from the N terminus of the motif is S, T, C, A, P, V, I, L, M. 



30 
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9. The method of claim 8, wherein the step of contacting is carried out 
by administering to the patient a pharmaceutical composition comprising the nucleic acid 
encoding the peptide that induces a CTL response and the T helper peptide. 

10. The method of claim 8, wherein the the peptide that induces a CTL 
response is linked to the T helper peptide. 



11. A composition comprising a peptide as shown in Table VIII. 



12. A method of inducing a helper T cell response in a patient, the 
method comprising contacting a helper T cell with a peptide of claim 11. 

13. The method of claim 12, wherein the step of contacting is carried 
out by administering to the patient a pharmaceutical composition comprising the peptide. 



14. The method of claim 12, wherein the step of contacting is carried 
out by administering to the patient a pharmaceutical composition comprising a nucleic acid 
encoding the peptide. 
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